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ABSTRACT 

Deamination of cytosine (C), 5-methylcytosine (mC) 
and 5-hydroxymethylcytosine (hmC) occurs spontan- 
eously in mammalian DNA with several hundred 
deaminations occurring in each cell every day. The 
resulting potentially mutagenic mispairs of uracil (U), 
thymine (T) or 5-hydroxymethyluracil (hmU) with 
guanine (G) are substrates for repair by various DNA 
glycosylases. Here, we show that targeted inactivation 
of the mouse Smugl DNA glycosylase gene is suffi- 
cient to ablate nearly all hmll-DNA excision activity 
as judged by assay of tissue extracts from knockout 
mice as well as by the resistance of their embryo 
fibroblasts to 5-hydroxymethyldeoxyuridine toxicity. 
Inactivation of Smugl when combined with inactiva- 
tion of the Ung uracil-DNA glycosylase gene leads to a 
loss of nearly all detectable uracil excision activity. 
Thus, SMUG1 is the dominant glycosylase responsible 
for hmll-excision in mice as well as the major 
UNG-backup for U-excision. Both Smugf-knockout 
and Smugl /Ung-doub\e knockout mice breed 
normally and remain apparently healthy beyond 
1 year of age. However, combined deficiency in 
SMUG1 and UNG exacerbates the cancer predispos- 
ition of Msh2~ / ~ mice suggesting that when both base 
excision and mismatch repair pathways are defective, 
the mutagenic effects of spontaneous cytosine 
deamination are sufficient to increase cancer inci- 
dence but do not preclude mouse development. 

INTRODUCTION 

Uracil (U) can arise in DNA through either 
misincorporation of dUTP during DNA synthesis (giving 



rise to U:A pairs) or through deamination of cytosine (C) 
within the DNA duplex (resulting in U:G mispairs). In 
addition to C, mammalian cells also contain 5-methyl- 
cytosine (mC) and 5-hydroxymethylcytosine (hmC) (1). All 
three forms of cytosine undergo spontaneous deamination 
with an estimated 100-500 genomic C bases undergoing 
deamination per cell every day (2). Thus, if left unrepaired, 
cytosine deamination would be a major source of spontan- 
eous mutation. 

Several DNA glycosylases (UNG, SMUG1, TDG and 
MBD4) have been described in mammals to remove U or 
its derivatives from DNA (3). The UNG uracil-DNA 
glycosylase [whose ancestral counterpart in Escherichia 
coli was the first to be described (4)] is abundant, widely 
distributed and can act on U in single or double-stranded 
DNA. SMUG1, while originally named on the basis of its 
ability to excise U from single-stranded DNA ['single- 
strand-selective monofunctional uracil-DNA glycosylase'; 
(5)] is in fact much more active on U in double-stranded 
DNA, especially if mispaired with G (6-10). Thymine- 
DNA glycosylase (TDG) was named for its ability to 
excise thymine (T) from T:G mispairs (which arise 
through spontaneous deamination of mC residues that 
naturally occur in the context of mCpG dinucleotides), 
but can also act effectively on U when encountered as a 
U:G mispair as judged by assays of the recombinant 
enzyme (11). MBD4 contains both methyl-binding and 
glycosylase domains and, in in vitro assays, shows a 
similar target specificity (though lower specific activity) 
to TDG (12). 

Although C deamination has received much attention as 
a source of spontaneous DNA damage, it has more 
recently become evident that purposeful enzyme-catalysed 
deamination of C is used to trigger gene diversification in 
the immune system. The U:G mismatches generated in the 
immunoglobulin loci of activated B lymphocytes through 
the action of the enzyme activation-induced cytidine 
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deaminase (AID) are recognized by the UNG DNA- 
glycosylase as well as by the MSH2/MSH6 mismatch 
recognition protein in order to achieve somatic hyper- 
mutation of the immunoglobulin variable region and/or 
class switch recombination of the constant regions (13). 
The mammalian APOBEC3 enzymes are host restriction 
factors active against retroviruses which generate U in the 
DNA of retroviral replication intermediates through C 
deamination, although the way in which uracilation 
contributes to retroviral restriction is not fully resolved 
(14,15). 

There have been several suggestions that the DNA 
glycosylases active on U may also play a role in the 
erasure of mC from the genome that occurs during mam- 
malian development. An early suggestion was that TDG or 
MBD4 might directly excise mC from the DNA, since both 
enzymes exhibit an (albeit very weak) excision activity on 
mC:G base pairs (16-18). More recently, it has been 
proposed that mC erasure might involve either deamin- 
ation of mC to T (catalyzed by a member of the AID/ 
APOBEC family) with subsequent TDG-dependent repair 
of the resulting T:G mismatch (19-22) or alternatively that 
oxidation of mC to hmC by one of the TET family 
dioxygenases is followed by AID/APOBEC-catalysed de- 
amination to hmU and SMUG 1 -mediated excision of the 
hmU (23,24). However, definitive evidence demonstrating a 
role for a DNA glycosylase in physiological pathways of 
mC demethylation in animal cells is lacking. 

Although most of our knowledge about the different 
DNA glycosylases comes from studies of their substrate 
specificities and activities as conducted by biochemical 
assay of the various purified or recombinant proteins or as 
analysed in cell extracts, genetic studies can obviously give 
major insights into the functions of the different glycosylases 
in vivo. Ung~ l ~ mice are largely healthy although they show 
a deficiency in antibody diversification and an increased pro- 
pensity to tumour development after 1 year of age (25-27). 
MBD4~I~ mice show increased intestinal tumourigenesis in 
a cancer-susceptible background (28,29). TDG-deficiency 
results in embryonic lethality (30), an observation that has 
especially fuelled interest in possible TDG-dependent mech- 
anisms for genome demethylation. SMUG 1 -deficient mice 
have not so far been described although recent work with 
cell-lines suggests that SMUG1 might play a more major 
role in excision of U from U:G mismatches in mouse as 
opposed to human cells (31). 

We were especially interested in the effects of SMUG1 
deficiency since enforced expression of SMUG1 is able to 
partially reverse the mutator phenotype of Ung~ yeast (32) 
as well as rescue the antibody gene diversification defect of 
UNG-deficient B lymphocytes (33,34). This suggested that 
SMUG1 might act as a backup for UNG, possibly 
accounting for the relatively mild phenotype of UNG- 
deficient animals. 



MATERIALS AND METHODS 

Generation and breeding of Smugl~^~ mice 

Smugl~^~ mice were generated from targeted C57BL/6 
embryonic stem (ES) cells that were obtained from the 



European Conditional Mouse Mutagenesis Consortium 
(EUCOMM, project ID 23057). These ES cells had been 
targeted with a construct (35) which, on homologous 
targeting, will result in the insertion of a [3-galactosidase/ 
neo R gene-trap expression cassette into the intron 
separating the two coding exons of Smugl (Figure 1A). 
Transcription of the targeted Smugl allele is expected to 
result in the production of a severely truncated SMUG1 
protein comprising only the N-terminal 97 amino acids 
encoded in Smugl exon 1 followed by 40 amino acids 
produced by a frameshifted reading of the mouse 
engrailed 2 gene encoded by the inserted gene trap. 

Cells of the EUCOMM targeted ES cell clone 
smugl -C05 were injected into blastocysts from albino 
C57BL/6-Tyr mice to generate chimeras, which were 
then crossed with C57BL/6 mice to generate heterozygous 
and ultimately homozygous Smugl~^~ mice. A second line 
of Smugl~ ! ~ mice was established from EUCOMM clone 
smugl-B07 but showed no differences from animals estab- 
lished using clone smugl-C05 in respect of mouse health 
and deficiency in glycosylase activity. The Smugl~^~ mice 
were interbred with Ung~ ! ~ mice (25) that were kindly 
provided by Deborah Barnes and Tomas Lindahl 
(London Research Institute, CRUK, UK) as well as 
with Msh2~ l ~ mice (36) that were kindly provided by Dr 
H. te Reile through Dr K. Brown (CRUK Beatson 
Laboratories, University of Glasgow). Animal studies 
were approved by the Laboratory of Molecular Biology 
Ethical Review Committee and performed under United 
Kingdom Home Office Project License PPL80/2226. 

Embryo fibroblast cultures 

Spontaneously, immortalized cell lines were obtained 
through extended cultivation (>20 passages) in DMEM/ 
10% FBS of primary fibroblasts from day 13.5 male 
embryos of the desired genotypes. Cells were passaged at 
~70% confluence during culture and kept subconfluent 
for all experiments. For toxicity studies, 3000 cells per 
well were seeded in a 96-well plate, incubated in medium 
overnight and then exposed to the compound of interest 
for 48 h. Cell viability was determined immediately after 
medium change using Cell Counting Kit 8 (Dojindo 
Molecular Technologies). 

Staining of mouse tissues 

Freshly isolated mouse tissues were rinsed with cold PBS, 
then fixed in 4% paraformaldehyde at 4°C for 1 h. After 
3 x lOmin rinses with cold PBS, tissues were stained at 
37°C in a solution made by mixing nine parts 0.4mg/ml 
5-bromo-4-chloro-indolyl-(3-D-galactopyranoside (X-gal), 
ImM MgCl 2 in PBS with one part 0.5 M K 3 [Fe(CN) 6 ]/ 
0.5 M K 4 [Fe(CN) 6 ]. 

Analysis of DNA and RNA 

For Southern-blot analysis, DNA was extracted from 
mouse tail biopsies, digested with EcoRV and subjected 
to electrophoresis on an 0.8% agarose gel prior to transfer 
onto Hybond-XL nylon membranes (GE Healthcare). 
Membranes were then hybridized with PCR-generated 
probes specific for regions of the Smugl locus located 
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Figure 1. Targeted inactivation of the Smugl gene. (A) Map of the wild-type and targeted Smugl locus. The wild-type Smugl locus comprises two 
coding exons (El and E2) that, respectively, encode the first 97 and remaining 183 amino acids of the protein, with the latter being essential for 
catalytic activity (37). The targeted allele is predicted to contain a 7.2-kb cassette insertion into the intron separating these coding exons. This 
gene-trap cassette comprises a splice acceptor site [SA] (derived from the mouse engrailed 2 gene) followed by: (i) an IRES-driven P-galactosidase 
[|3-gal] reporter and polyadenylation signal [pA] and (ii) an aminoglycoside phosphotransferase cDNA [conferring neomycin resistance (neo)] whose 
expression is driven from a phosphoglycerate kinase promoter \pPGK\. The locations of the EcoRV restrictions sites [RV] and of the probes P L and 
P K that were used for Southern-blot analysis (and that lie to the left and right of the DNA used for gene targeting) are indicated. (B) Southern-blot 
analysis of splenocyte DNA from wild-type, homozygous targeted and heterozygous mice digested with EcoRV and hybridized with a mixture of 
probes P L and P R . Whereas the wild-type locus is predicted to give a single 27.5-kb EcoRV fragment that hybridizes with both probes, the targeted 
allele is anticipated to give a 7.7-kb band hybridizing with P L and a 19.8-kb band hybridizing with P K . (C) Northern-blot analysis of liver RNA from 
wild-type and homozygous targeted mice hybridized with a probe internal to Smugl exon 2; P-actin served as a loading control. (D) Genotypes of 
progeny of inter-heterozygote crosses as determined from tissue biopsies taken at the time of weaning. 



either up- or downstream of the homology arms used in 
the targeting construct (Figure 1 A). Primers for the down- 
stream probe were S^CCCTTTACCTTGAGCCCTTC 
and S'-TCTTGGACGTCCTTGTTTCC. Two PCR amp- 
lifications yielding products that flank a region of repeti- 
tive sequence were used for the upstream probe: S^TCAG 
TGCTGAGGGGCTAGTT^-ACATCTGCCAGCCTG 
TATCC and 5 / -GTAAACAGTTAATGGCCCTTGG/5 / - 
GCCTTAAGAAACATGTCACACA. 

For analysis of RNA from mouse liver, polyA + RNA 
was purified using RNeasy and Oligotex kits (Qiagen) ac- 
cording to the manufacturer's instructions and aliquots 
(4ug) subjected to electrophoresis on 1.2% agarose/ 
MOPS-acetate gels, transferred onto Hybond-XL mem- 
branes and hybridized with a 317-nt probe for Smugl 
exon 2 generated by PCR using primers S^GTTGGGG 
GCCCTGTGCTGAC and S^CCGCCCCACTCCCACT 
ACCA. 



DNA-glycosylase assays 

Nuclear lysates were prepared according to a modified 
version of the procedure described in (6). Snap-frozen 
tissues were thawed on ice and finely minced with a 
razorblade before passage through a 70 um nylon mesh 
cell strainer (BD Falcon). Fragments were washed twice in 
ice-cold PBS in the presence of protease inhibitors (Roche) 
and then resuspended in twice the packed cell mass of hypo- 
tonic buffer A (lOmM HEPES/KOH pH 7.7, 0.5 mM 
MgCl 2 , lOmM KC1, 1 mM DTT plus protease inhibitors). 
After 1 5-min incubation to lyse cell membranes, nuclei were 
pelleted by centrifugation (2000 g, lOmin) and incubated in 
two volumes of nuclear lysis buffer (20 mM HEPES/KOH 
pH 7.7, 0.5 mM MgCl 2 , 0.42 M NaCl, 0.2 mM EDTA, 
1 mM DTT, 25% glycerol plus protease inhibitors). After 
20-min incubation on ice, particulates were removed by cen- 
trifugation (14 000 g, lOmin) and the supernatants dialysed 
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for 6h against 25 mM HEPES/KOH pH 7.7, 50 mM KC1, 
2mM DTT through a 10-kD molecular weight cut-off 
membrane, aliquoted and snap-frozen for storage. Protein 
content was determined using a BioRad Bradford kit 
according to the manufacturer's instructions. 

Excision activities in nuclear extracts were determined by 
mixing 5 jig protein with 25 fmol oligonucleotide substrate 
and 2.5 U of APE1 (NEB) in 20 mM Tris-HCl pH 8.0, 
50 mM NaCl, 1 mM DTT, 1 mM EDTA, lOO^ig/ml BSA 
(final volume 10|il) for 1 h at 37°C. After addition of an 
equal volume of formamide, reactions were boiled for 
20min at 100°C (to enhance cleavage at abasic sites in 
single-stranded DNA substrates which are poor substrates 
for APE1) and run on 10% denaturing polyacrylamide/ 
TBE-Urea gels (Invitrogen). The oligonucleotide substrate 
and cleaved product were visualized using a Typhoon 
scanner and band intensities quantified using Image J. The 
single-stranded oligonucleotide substrates P'-fluoresc- 
einated 5'-CATAAAGTGxAAAGCCTGGA or 5'-CAT 
AAAGTGxAAGCCTGGA, where x = U, hmU or FU), 
and their complements with a centrally placed A or G 
were obtained HPLC-purified from Integrated DNA 
Technologies or the Midland Certified Reagent Company. 

RESULTS 

Mice carrying a targeted p-galactosidase insertion 
in the Smugl locus 

Embryonic stem cells of C57BL/6 origin were obtained 
that carried a targeted insertion between the two coding 
exons of Smugl. The inserted cassette comprises (from 5'- 
to Spends): an RNA splice acceptor site, an internal ribo- 
somal entry site (IRES), a promoterless P-galactosidase 
gene and a phosphoglycerate kinase (PGK) promoter- 
driven aminoglycoside phosphotransferase gene which 
confers neomycin resistance (Figure 1A). It is thus 
anticipated that the P-galactosidase gene will be expressed 
under control of the endogenous Smugl transcription 
regulatory signals but that the second exon of Smugl 
(which contains most of the SMUG1 protein-coding 
sequence) will not be transcribed into mRNA. These 
*Smwg7:Pgal/neo ES cells were injected into blastocysts 
isolated from albino C57BL/6-7jr mice and the resultant 
chimaeras bred to obtain heterozygous mice carrying 
the Smwg7:Pgal/neo insertion on one allele in their 
germline. 

Southern-blot analysis of tail DNA from Smugl:fig&\/ 
neo heterozygous mice revealed the expected change in 
Smugl -hybridizing restriction fragments (Figure IB). 
Interbreeding of the heterozygotes yielded Smugl:$ga\/ 
neo homozygotes at the expected Mendelian frequency 
with northern-blot analysis revealing that these homozy- 
gotes had indeed lost Smugl mRNA as judged by 
northern blotting with an exon 2 probe (Figure 1C and 
D). As indicated under 'Materials and Methods' section, 
the nature of the gene targeting might allow the produc- 
tion of a polypeptide including the first 97 amino acids of 
SMUGL However, not only would such a polypeptide 
lack essential catalytic residues of SMUG1, experiments 
designed to detect a possible dominant inhibitory effect of 



liver nuclear extracts from homozygous Smugl:$gal/neo 
mice in base excision repair assays proved negative 
(Supplementary Figure 1). Thus, SmwgiiPgal/neo homo- 
zygous mice are viable, show no obvious impairment in 
health or fertility with any phenotype likely attributable to 
the loss of Smugl gene expression. We designate these 
animals Smugl~'~ mice. 

Targeted Smugl inactivation results in loss 
of hmU-excision activity 

Recombinant SMUG1 has been shown not only to be 
active in excising U from DNA, but also active on 
several pyrimidines that carry substitutions at the C5 
position such as hmU, 5-formyluracil, 5-hydroxyuracil 
and 5-fluorouracil (8,37,38). Analysis of both brain and 
liver cell extracts revealed that Smugl inactivation results 
in loss of nearly all detectable hmU-excision activity 
(without notably impairing the total U-excision activity 
in the extracts). A similar ablation of hmU-excision 
activity was noted in all other tissues analysed (data not 
shown). The results indicate that SMUG1 is the major if 
not sole enzyme responsible for hmU excision in the 
mouse (Figure 2A). 

Smugl -1- embryo fibroblasts are resistant 
to 5-hydroxymethyldeoxyuridine toxicity 

HmU can arise in DNA not only as a result of modifica- 
tion of one of the bases naturally present in DNA but also 
from misincorporation of 5-hydroxymethyldeoxyuridine 
(hmUdR) during DNA synthesis. Indeed, hmUdR is 
toxic to cells (39) with studies in CHO cells having 
revealed that this toxicity is likely due to excessive 
excision of misincorporated hmUdR (40,41). To test 
whether SMUG1 is responsible for hmUdR toxicity in 
mouse cells, we established embryo fibroblast lines from 
Smugl~ ! ~ as well as SMUG 1 -proficient control embryos 
and tested their sensitivity to culture with varying concen- 
trations of hmUdR. The results (Figure 2B) show that 
deficiency in SMUG1 is indeed sufficient to overcome 
sensitivity to hmUdR, consistent with SMUG1 being the 
dominant if not sole glycosylase involved in hmUdR 
excision from genomic DNA in mice. Curiously, the 
UNG-deficient cells appear slightly more sensitive to 
hmUdR than the wild-type controls. We do not know 
the basis for this increased sensitivity. It does not seem 
to be due to a compensatory increase in Smugl expression 
since RT-PCR analysis reveals no major perturbation in 
Smugl RNA levels (data not shown) but it might 
conceivably reflect an increased efficacy with which 
SMUG1 itself acts on the hmU in DNA or with which 
the abasic site is processed in the absence of UNG. 

Although SMUG1 is therefore responsible for making 
mouse cells sensitive to being cultured in the presence of 
hmUdR, a more likely physiological function of SMUG1 
is to mediate the repair of hmU that arises in DNA 
through either the oxidation of T or the spontaneous 
deamination of hmC. Indeed, recent data have focused 
on hmC as a component of mammalian genomic DNA 
with indications that it is especially abundant in brain 
(42-44). However, SMUG 1 -deficiency does not lead to 
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Figure 2. Smugl inactivation ablates hmU excision activity and gives 
resistance to hmUdR. (A) Analysis of base excision activity in brain 
and liver nuclear extracts from wild-type (WT), Ung~^~ and Smugl~^~ 
mice assayed on a 3'-fluorescently labelled double-stranded oligodeox- 
yribonucleotide substrate containing a centrally placed hmU:G mispair 
(left), as well as on a single-stranded 3'-fLuorescently labelled substrate 
containing a single centrally placed U as an UNG activity control. 
Excision creates an abasic site that is subject to cleavage by APE1; 
cleavage products of the 3'-fluorescently labelled oligonucleotides 
were visualized after polyacrylamide gel electrophoresis. Similar 
results regarding a loss of hmU-excision activity were obtained on 
assaying multiple different tissues from Smugl~^~ mice of different 
ages. (B) Effects of a 48-h culture in the presence of various concen- 
trations of hmUdR on the viability of embryo fibroblast cell-lines 
derived from mice of different genotypes. Survival is expressed 
relative to that obtained in the absence of hmUdR. The results are 
shown for triplicate assays performed on two independently derived 
cell-lines for each indicated genotype. 



any gross abnormalities in the brain as judged by crude 
histological examination (Figure 3A and B), even though 
in keeping with earlier studies (45) we find that brain 
shows a significantly higher ratio of hmU to U excision 
activity as compared to extracts from other tissues tested 
(Figure 3C). 

Lack of detectable U-excision activity 
in Ung~l~ Smugl -1- double knockouts 

Mice deficient in the UNG uracil-DNA glycosylase retain 
residual U excision activity as judged by biochemical assay 
of tissue extracts. Antibody inhibition experiments as well 
as analysis of fibroblast lines carrying a Smugl knockdown 
expression construct have indicated that SMUG1 is 



responsible for much of this backup activity (6,25,38,46). 
Comparison of tissue extracts from mice carrying homozy- 
gous disruptions of either Ung alone, Smugl alone or both 
Ung and Smugl reveals that residual U-excision activity as 
judged on a double-stranded oligodeoxyribonucleotide 
substrate containing a U:G mismatch is still evident in 
tissues from Ung -1- mice but that this residual activity is 
essentially absent in extracts from Ung -1- Smugl -I ~ 
double knockouts (Figure 4A). Thus, in all the tissues 
examined, SMUG1 is the major UNG backup for 
excising U from U:G mismatches — indeed, the sole detect- 
able backup within the sensitivity of these assays. Under the 
conditions in which these assays were performed, there is 
little evidence of any UNG-backup activity when a 
single-stranded U-containing oligonucleotide was used as 
substrate (Figure 4B), consistent with previous observa- 
tions (6) that SMUG1 exhibits much less activity on 
single- as opposed to double-stranded DNA substrates. 

Fluorouracil excision activity in Smugl -1- 
and Ung~ f ~ Smugl~ f ~ mice 

5-Fluorouracil (FU) is a base analogue that has found 
widespread use in cancer treatment and can be 
incorporated into both RNA and DNA. Both UNG and 
SMUG1 are known to be able to excise FU from 
double-stranded oligonucleotides (46-48) and assays of 
liver nuclear extracts from knockout mice indicate that 
inactivation of either UNG or SMUG1 is sufficient to 
reduce the overall excision activity detected on an FU:G 
substrate (Figure 4C). However, inactivation of both Ung 
and Smugl does not give an additive reduction in excision 
activity with residual FU:G excision activity still being 
detected in Smugl~^~Ung~^~ extracts. The results there- 
fore indicate a contribution from at least one other 
glycosylase, with TDG being a possible candidate 
(48,49). When assayed on a single-stranded substrate, 
UNG appears to be the dominant enzyme responsible 
for FU excision, consistent with observations made previ- 
ously by Pettersen et al. (50). It is nevertheless notable that 
deficiency in either UNG or SMUG1 has little effect on 
the sensitivity of cells to FU (Figure 4D), consistent with 
the proposal that FU toxicity is largely due to its incorp- 
oration into RNA, rather than DNA (50). 

Longevity of Smugl -1- , Ung~^~ Smugl~^~ 
and Ung~ f ~ Smugl~ f ~ Msh2~ l ~ mice 

Not only were Smugl~ l ~ mice born at the expected 
Mendelian ratio from Smugl +I ~ intercrosses, we have also 
not detected any effects of SMUG 1 -deficiency on mouse 
health or survival as monitored out beyond 1 year of age 
(Figure 5). Strikingly, despite the lack of detectable 
uracil-excision activity in their tissues, Ung~ j ~ Smugl~ j ~ 
double knockout mice were also born at the expected fre- 
quency from inter-heterozygote crosses and appear healthy. 
Out of 32 Ung~!~ Smugl~'~ mice followed for > 1 year, only 
one has died so far, with the cause of death likely being un- 
related to the genetic defect (fight wounds). This is consistent 
with the observations of Nilsen et al. (27) who found that a 
modest increase in morbidity of Ung -1- mice only became 
manifest at > 1 8 months of age. Thus, it is evident that mouse 
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Figure 3. Activity of the Smugl locus in different tissues as judged by |3-gal reporter and hmU-excision activity. (A and B) X-gal staining of day 13.5 
embryos and of sagittal and coronal sections of adult mouse brains from wild-type and Smugl~^~ mice (which carry the |3-gal/neo gene-trap 
expression cassette in the Smugl locus, driven by the endogenous Smugl promoter) as well as from wild-type (WT) controls, arb, arbor vitae; 
CA, central amygdalar nuclei; CBX, cerebellar cortex; CP, caudoputamen; CTX, cerebral cortex; DN, dentate nucleus; HPF, hippocampus; HY, 
hypothalamus; MY, myelencephalon; opt, optical tract; PSV, principal sensory nucleus of the trigeminal; SN, substantia nigra; SP, pyramidal layer; 
SPVC, spinal nucleus of the trigeminal (caudal part); TH, thalamus. (C) Comparison of the activity of nuclear extracts from various tissues on 
double-stranded oligodeoxyribonucleotide substrates containing central hmU:G and U:G mismatches. Recombinant SMUG1 (rSMUG; New 
England Biolabs) served as control. On the right hand side, the excision activity on hmU:G relative to that on U:G is shown for different 
tissues. Activities were determined as the product band intensity divided by the integrated total band intensity in each assay and is based on the 
analysis of tissues from three mice. 



health and survival up to 1 year of age is not severely 
compromised by a lack of detectable uracil excision activity. 

We wondered if the apparent health of the double 
knockout animals could reflect the fact that the U:G 
mispairs generated through spontaneous C deamination 
might, in the absence of UNG and SMUG1, be corrected 
by mismatch repair. We therefore asked whether it was 
possible to generate mice that simultaneously lack UNG, 
SMUG1 and MSH2 (with MSH2 being essential for 
mismatch repair). We found that such mice could be 
generated although the reduced fertility of Msh2~ l ~ mice 
and the complexity of the breedings mean that we have so 
far generated only six triple knockout mice. The triple 
knockout animals were obtained at somewhat below the 
expected frequency since, out of the 1 59 animals screened 
(which had been weaned from 55 litters arising from 21 
different breeding pairs), we would have anticipated ob- 
taining 10.8 triple knockouts on a Mendelian basis given 
the nature of the genotypes used for the various crosses. 

Msh2~ l ~ mice are known to exhibit greatly reduced 
survival with a mean longevity of ~6 months (51). 



Although we have only so far obtained six Ung~'~ 
Smugl -1- Msh2~ l ~ triple knockout mice, they show a 
greatly reduced longevity with a mean life expectancy of 
107 days (as compared to 159 days for the Ung~^~ Msh2~ l 
~, 238 days for the Smugl~ ! ~ Msh2~ ! ~ mice and 223 days 
for the Msh2~ l ~ mice). It therefore appears that combined 
deficiency in uracil excision and mismatch repair results in a 
more severe phenotype than mismatch repair deficiency 
alone (P < 0.01). Indeed, not only is the reduced mean life 
expectancy of the triple knockout mice statistically highly 
significant, it is also most striking that all deaths in the triple 
knockout mice occurred before 152 days of age whereas the 
mean life expectancy of all the other single/double knockout 
strains analysed is >157 days. Similar to what has been 
noted with Msh2~ ! ~ mice (36,52), the triple knockout 
mice that have died so far have all died from tumours, 
which were lymphoid tumours in four of the five cases. 

The results therefore show that it is possible to generate 
mice that are deficient in both uracil-excision repair and 
mismatch repair but that such mice show severely 
compromised survival that appears to be largely due to 
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Figure 4. SMUG1 is the major UNG-backup for U excision from 
U:G-containing double- stranded oligodeoxyribonucleotides. (A) Assay 
of nuclear extracts of tissues from wild-type (WT), Smugl~^~ (Sm~), 
Ung~^~ (Un~) and Smugl~^~Ung~^~ (Sm~ Un~) mice for excision 
activity on a double-stranded oligodeoxyribonucleotide substrate (S) 
containing a centrally placed U:G mispair. P denotes the product fol- 
lowing cleavage. Recombinant UDG (rUDG; NEB) served as a 
positive control. Representative results are shown for different tissues 
from one set of mice as well as from livers of four other sets. (B) Assay 
of nuclear extracts for excision activity on a single-stranded oligodeox- 
yribonucleotide containing a single centrally placed U. (Similar results 
were obtained if samples were boiled under alkaline conditions confirm- 
ing that the weak signals obtained on single-stranded DNA substrates 
do not reflect incomplete strand cleavage at abasic sites.) (C) Assay of 
liver nuclear extracts for excision activity on a double-stranded 
oligodeoxyribonucleotide containing a centrally placed FU:G mispair. 
The result presented is representative of three such analyses. (D) Effects 
of a 48-h culture in the presence of various concentrations of FU on 
the viability of embryo fibroblast cell-lines obtained from mice of dif- 
ferent genotypes {WT, filled circles; Smugl~ ! ~, open circles; Ung~^~, 
filled triangles). Survival is expressed relative to that obtained in the 
absence of FU. The results are shown as the average of triplicate assays 
performed on embryonic fibroblast cell-lines derived from mice of each 
indicated genotype. 



tumours and that the compromised survival of these 
animals seems somewhat more severe than is attributable 
to MSH2-deficiency alone. 



DISCUSSION 

The results described here reveal that ablation of the 
Smugl gene in mice removes essentially all detectable 
hmU-excision activity as well as the residual U-excision 
activity that is observed in tissue extracts of Ung~^~ mice. 
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Figure 5. Effects of deficiency in SMUG1, UNG and MSH2 on mouse 
longevity. Kaplan-Meyer survival curves for mice of different genotypes as 
monitored up to >1 year of age. Data for Smugl~^~ (n = 39), Smugl~^~ 
Ung~^~ (n = 32) and Smugl~^~ Ung~^~ Msh2~ l ~ (n = 6) mice is based on a 
prospective study of cohorts of mice that were left to age; data for Msh2~ l ~ 
(n = 47) and Ung~^~ Msh2~ ! ~ (n = 42) is derived from retrospective 
analysis of all mice in the colony. In the prospective cohort studies, each 
symbol indicates a death whereas in the retrospective analyses each symbol 
indicates the end of observations for one animal with the survival line only 
coming down if this observation endpoint was due to a disease-related 
death rather than caused by a healthy animal being removed from the 
colony (e.g. for use in experimentation). The median life expectancy, 
determined from retrospective analysis, was >420 days for Smugl~ l ~ 
and >510 days for Smugl~^~ Ung~^~ mice (the cohorts having so far 
only been followed for those numbers of days with too few mice having 
died to give a median life expectancy), 223 days for Msh2~ l ~, 159 days for 
Ung~^~ Msh2~ l ~ and 107 days for Smugl -1- Ung~^~ Msh2~ l ~ mice. 



That SMUG1 accounts for the residual uracil-excision 
observed in tissue extracts from Ung~^~ mice when 
assayed on oligodeoxyribonucleotide substrates contain- 
ing a U:G mismatch is consistent with previous depletion 
and inhibition studies (6,25,38,46). It is, however, striking 
that Smugl~^~ Ung~^~ double knockout mice in which all 
such readily detectable uracil-excision activity has been 
ablated can nevertheless be easily generated and are 
largely healthy up till at least 1 year of age. This indicates 
that the lesions generated by spontaneous C and hmC 
deamination in these animals must either be dealt with 
by another pathway or that these do not occur at a fre- 
quency that jeopardizes mouse development and viability. 

It is conceivable that the U or hmU residues generated 
by C or mC deamination, respectively, could be repaired 
by TDG, since in vitro studies have shown that recombin- 
ant TDG can act on both U:G and hmU:G mismatches 
(53). It is difficult to test whether TDG actually acts as an 
essential backup in repairing U:G lesions in vivo since 
deficiency in TDG alone is embryonically lethal (30). 
However, it is notable that not only do we not readily 
detect any back up uracil excision activity in extracts 
from Smugl~^~ Ung~^~ mice (which could obviously 
reflect the sensitivity of our in vitro detection), but we 
have previously observed that enforced overexpression 
of TDG in mouse B cells is not sufficient to process 
AID-generated U:G lesions into class-switch recombin- 
ation, whereas both UNG and SMUG1 function 
effectively in such assays (34). Thus, although TDG may 
well act on T:G mismatches that result from deamination 
at mCpG dinucleotides in vivo, there must be some 
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question as to whether TDG plays a major role in the 
general in vivo repair of genomic U:G lesions. 

A likely backup pathway for the repair of U:G mis- 
matches is provided by mismatch repair. Indeed, studies of 
antibody diversification provide good evidence that MSH2/ 
MSH6 does indeed recognize and process U:G mismatches 
in vivo (13). We find in this work that Smugl~^~ Ung~^~ 
Msh2~ l ~ triple knockout mice are viable though, like 
Msh2~ l ~ single knockouts, they are cancer-prone. There is 
a clear indication that the cancer predisposition of Msh2~ l ~ 
mice is exacerbated if the animals are also rendered deficient 
in UNG and SMUG1. These results suggest that when both 
base excision and mismatch repair pathways are blocked, 
spontaneous cytosine deamination occurs at a frequency 
sufficient to increase cancer incidence but not to preclude 
mouse development. 

The presence of an hmU-excision activity in mamma- 
lian cell extracts was noted >20 years ago (45,54) with this 
activity being found to be especially effective on an 
hmU:G mismatch (55). Subsequent purification of the pre- 
dominant hmU-excision activity attributed it to SMUG1 
(56) although recombinant TDG has also been shown to 
be able to excise hmU from DNA in vitro (24,53). Our 
results with Smugl~^~ mice not only confirm that 
SMUG1 accounts for essentially all the hmU-excision 
activity detectable in mouse tissue extracts as judged by 
biochemical assay on an hmU:G double-stranded oligo- 
nucleotide substrate, they also show that SMUG1 is 
responsible for the sensitivity of mouse cells to culturing 
with hmUdR — analogously to what has been observed 
with a mutant Syrian hamster cell line (40). 

It was speculated several years ago on the basis of 
phylogenetic evidence that hmU-DNA glycosylase might 
function in the maintenance of 5-methylcytosine in 
genomic DNA (57). Interest in this possibility has 
recently gained a considerable boost in light of findings 
concerning hmC as a natural component of genomic DNA 
in mammals (42,43,58) as well as from suggestions that 
oxidation of mC —> hmC followed by deamination of 
hmC —> hmU and subsequent hmU excision might consti- 
tute physiological steps in the demethylation of mC 
(23,24). Although the viability, fertility and apparent 
health of SMUG 1 -deficient mice do not wholly exclude 
a role for hmU excision in the physiological demethylation 
of mC, the results do suggest that hmU excision is not 
essential for the process. Given the presence of hmC as 
a natural component of mammalian DNA, it may simply 
be that SMUG1 predominantly evolved to take care of the 
hmU arising out of spontaneous hmC deamination, as 
well as from thymine oxidation. 
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